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Abstract 

We investigate the statistics of the phenomenologicaUy important D-brane sector of string 
compactihcations. In particular for the class of intersecting D-brane models, we generalise 
methods known from number theory to determine the asymptotic statistical distribution 
of solutions to the tadpole cancellation conditions. Our approach allows us to compute the 
statistical distribution of gauge theoretic observables like the rank of the gauge group, the 
number of chiral generations or the probability of an SU{N) gauge factor. Concretely, we 
study the statistics of intersecting branes on and and x Z2 orientifolds. 

Intriguingly, we find a statistical correlation between the rank of the gauge group and 
the number of chiral generations. Finally, we combine the statistics of the gauge theory 
sector with the statistics of the fiux sector and study how distributions of gauge theoretic 
quantities are affected. 
11/2004 



1. Introduction 

For many years, string ttieorists liave studied various kinds of string compactifications 
with the main motivation to find a model one day which resembles the Standard Model of 
particle physics, which we know is a good effective description of nature around the weak 
scale. Of course, it is very compelling that string theory in some sense predicts (or at least 
contains) gravity and at the same time also gauge theory, but it turned out to be a very 
difficult problem to be more precise about the predictions for the gauge theory sector at 
low energies. There were simply too many possible candidates which at best resembled the 
Standard Model in certain aspects but not in its full glory. However, one could also say 
that if we had constructed the Standard Model already in a few attempts, we could have 
simply been very lucky or more probably would have just found one of a vast number of 
stringy realizations of the Standard Model. 

Recall that in the eighties the weakly coupled heterotic string was thought to be 
the best candidate to deliver a realistic model. Various (supersymmetric) constructions, 
like orbifolds, bosonic lattices, free fermion constructions, Calabi-Yau manifolds in toric 
varieties. Landau- Ginzburg and Gepner models etc. were studied and partly classified in 
a large number of publications. Even today it is not clear whether there exist finitely or 
infinitely many heterotic string backgrounds. 

With the advent of D-branes in the mid nineties, new classes of phenomenologically 
attractive string compactifications were found where the gauge theory is realized by the 
open strings ending on D-branes. Different constructions were proposed like orientifolds 
with branes on singularities or intersecting D-branes. It was also realized that the string 
scale does not necessarily have to be close to the Planck scale, as in D-brane constructions 
low string scale models can exist and would lead to a completely new phenomenology in 
the TeV range [|l|]. In this case, supersymmetry would no longer be necessary for solving 
the gauge hierarchy problem. 

In addition, with the extension of the relevant dimensions to eleven and even twelve, 
new geometric compactifications were possible like M-theory on G2 manifolds or F-theory 
on Calabi-Yau fourfolds. Since among these different classes of string models various S and 
T dualities operate, not all constructions were considered to be independent, but rather 
provide descriptions in different regimes of the M-theory moduli space. It was clear by 
then that there exist very many string vacua, but nevertheless people hoped that maybe 
one could find a realistic model out of these, say, 10^° vacua 0. 

^ Numbers of this order appeared for instance in the classification of CY fourfolds [||. 
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Moreover, with the study of flux compactifications some additional progress 

was made: First, flux compactifications allowed to solve some of the problems the former 
purely geometric models notoriously had. In particular, certain fluxes induce an effective 
potential that still possesses supersymmetric minima, which allows to freeze (some of) the 
moduli generically appearing in string theory. Second, it was also possible to break super- 
symmetry in a controlled way by turning on additional internal flux components. Finally, 
by taking also some non-perturbative effects into account, for the first time strong evidence 
was given that non-supersymmetric meta-stable de-Sitter vacua do exist in string theory 
||^,PI, [TO|JTl]] . Concerning the classification of string vacua, these flux compactifications were 
also quite surprising, as it turned out that there exist astronomical numbers of minima 
of the flux induced potential. One is now not talking about 10^'^ vacua but about 10^°° 
different vacua in the string theory landscape. 

M.R. Douglas emphasized that the search for the Standard Model like vacuum does not 
really make sense in view of this huge number of string models [|12[. Given the precisions 
with which the Standard Model parameters are measured there probably exists still a large 
amount of stringy realizations consistent with these precisions. Of course, finding one of 
these string models would still be a big success from the phenomenological point of view, 
as once the background is fixed there probably exist fewer parameters than in the Standard 
Model. 

Contrarily, M.R. Douglas proposed a different complementary approach to the string 
vacuum problem. Given the huge numbers which are occurring, one had better try a 
statistical approach. Instead of computing phenomenologically important quantities for 
each model separately, we ought to compute expectation values or distributions of these 
quantities in the ensemble of string vacua. Based on this approach, it was proposed that 
some of the fine tuning problems like for the cosmological constant or for the weak scale 
can find a purely statistical explanation on the string landscape. If the amount e of fine 
tuning is larger than the density of the supposedly uniformly distributed string models, 
e > there can still be a sufficient number of string vacua giving rise to the right value 
|T^,|n|,|T^ . Of course, questions like why nature has chosen a specific, maybe even unlikely 
value ask for a vacuum selection principle and are beyond the statistical approach. Some 
authors have suggested that eventually the weak anthropic principle might be the only 
clue for "explaining" the actual values of certain physical constants [|T6|jr^ . Whether this 
is the case, we don't know, but we think it is not yet time to give up searching for other 
explanations. 
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If also the second fine tuning problem, the gauge hierarchy problem, is merely solved 
statistically, then there would be no need for low energy supersymmetry breaking. There 
is still an ongoing debate of whether the string landscape favours a low or a high super- 



symmetry breaking scale [|T8| - pO| , but inspired by these string landscape ideas already new 
phenomenological models with high scale supersymmetry breaking have been proposed 
||2l|] . It was also suggested that branes with gauge fluxes respectively intersecting branes 



might provide a stringy realization of these so-called split supersymmetry models |2^. On 
the other hand, if one is still insisting in low scale supersymmetry breaking within the 
observable sector of the supersymmetric Standard Model, flux compactifications allow for 
an explicit computation of the soft supersymmetry breaking parameters (gaugino masses, 
squark and slepton masses etc.) in semi- realistic models with D3- and/or D7-branes [ p3| . 

At the moment we do not know whether string theory is realized in nature. Without 
having a proof that string theory is in a mathematical sense the unique quantum theory 
of gravity, we can only hope to find support for string theory experimentally. Progress in 
this direction is as we know hampered by the huge vacuum degeneracy. 

It was proposed that performing statistics of the string theory landscape could improve 
our understanding of the relevance of string theory in nature. Indeed, one could imagine 
that two different physical observables are statistically excluded so that effectively they 
are never realized together on the string theory landscape. Then this would practically 
falsify string theory. On the other hand one could imagine that by computing statistical 
correlations of physical observables on the string theory landscape we find more and more 
support that our universe satisfies these correlations. Of course this would not be a proof 
of string theory but we could hope to get more and more support from such considerations. 

However, there exists also the possibility of the depressing outcome that the statistical 
analysis does not show any strong correlations among various quantities so that it does 
not lead to any strong statements at all. In this scenario, all values of physical quantities 
are more or less uniformly realized and everything is equally likely. On the one hand, 
such uniform distributions are an advantage for solving some of the fine tuning problems 
and allow one to argue that Standard-like models should exist on the landscape, on the 
other hand they are rather boring, as they do not yield any at least statistical prediction. 
Therefore, it should be one of the major goals of a statistical analysis to reveal statistical 
correlations among physically relevant quantities. 

So far the main and most successful work on the statistical approach deals with the 
fiux induced F-terms and how they freeze the complex structure moduli. In fact, in ^4 
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a formula was derived for counting the index of vacua of the flux induced superpotential. 
This formula was successfully tested in [PB|J2B] for two different Calabi-Yau manifolds with 



one respectively two Kahler moduli. Phenomenologically it is of course very important 
to also study the statistics of the gauge theory sector of string compactifications. Some 



general comments and estimates were made in the original work [|T2|. 

The aim of this paper is to develop more refined methods to work out the statistical 
distributions of physically interesting quantities in the gauge theory sector of string vacua. 
In the original Type IIB flux models the only D-branes were parallel D3-branes, on 
which no interesting physics like chirality can occur. It was then proposed to augment this 
set-up by more general branes, like branes on singularities or branes with non-trivial 2-form 
gauge fluxes (intersecting branes in the T-dual picture). It is known that these intersecting 



D-brane models are constrained by only fairly moderate consistency conditions p7| , p8| , p9| 
which still allow for a large number of solutions. This makes them good candidates for a 
statistical approach. Note that the statistics of D3-branes was discussed in . 



In this paper, given a closed string background, we will present a method to count 
classically different solutions to the tadpole cancellation conditions. As it stands, this is a 
purely number theoretical problem. It turns out that the counting of unordered partitions 
is very similar to this task. It is precisely the saddle point method working quite well in the 
former case which can be successfully carried over to our problem. It allows us to compute 
the statistical distribution of various gauge theoretic quantities and to reveal statistical 
correlations among quantities which from the gauge theory point of view are completely 
unrelated. Therefore, this can be considered as a purely string theoretical statistical effect 
and, of course, can be traced back to the fact that string theory is a more constrained 
system than gauge theory. 

This paper is organised as follows. In section 2 we formulate the problem of counting 
supersymmetric solutions to the tadpole cancellation conditions arising in intersecting 
D-brane models of the Type IIA string theory. Note that via T-duality these models are 
related to Type IIB orientifolds with even dimensional branes with non-trivial gauge fluxes 
turned on. Therefore, as worked out in [pli|32|j3^ , we can combine these models with the 



Type IIB flux compactifications of 0,^. In a first approximation, the brane and the fiux 
sector are quite independent, the latter giving rise to F-term potentials with the former 
leading to D-term potentials. The only effect of the fiux is a contribution to the R-R 
4-form tadpole cancellation condition. 



In section 3, as a toy model we study the analogous eight dimensional problem where 
we have only compactified on a two-dimensional torus. This model is sufficiently simple, 
but still close enough to our actual problem to allow for a clear explanation of our method. 
In addition, for this simple model we also perform a brute force computer search for all 
solutions to the tadpole cancellation condition. Note that the semi-analytic saddle point 
method we will propose is much more effective and far less time consuming than this brute 
force numerical classification. 

In section 4 we will generalise the method to six-dimensional models on ||34 



where new aspects like chirality and complex structure dependence arise. We will derive the 
statistical distribution for various quantities of interest like the total number of solutions, 
the probability of an SU (M) gauge factor, the distribution of the rank of the gauge group 
and the distribution of the number of families. These 6D results are again supported by 
a numerical computer search for all solutions. Intriguingly, we find a correlation between 
the rank of the gauge group and the number of families. 

In section 5 we come to the physically most interesting example and compute the 
same distribution for the case of intersecting branes on the jTL^ x ^2 orientifold p8| , p5| . 
It turns out that this case is even more complicated than the 6D case, mainly because su- 
persymmetry also allows for negative contributions to the tadpole cancellation conditions. 
We will show that the number of solutions to these conditions for fixed orientifold and 
3-form fiux contributions is finite. Surprisingly, all three different models in 8D, 6D and 
4D qualitatively show the same behaviour, so that one can suspect that the statistics of 
intersecting D-brane models in general is already captured by our concrete examples. For 
this case, some of the computations we did in 8D and 6D are already very time consuming. 
Therefore, we restrict ourselves in this paper to a selection of those statistical distribu- 
tions which could be computed in reasonable time, while postponing a more thorough 
investigation to a future publication [l36|| . 

In section 6 we combine the gauge theory sector distributions computed in section 
5 with the statistics of fiux compactifications and analyze how taking both effects into 
account changes the statistics. In particular, we focus on the dependence of the distribution 
of the rank of the gauge group on the number of 3-cycles where one can turn on three-form 
fluxes. 

Finally, in section 8 we give our conclusions and an outlook of what kind of statistical 
questions one might want to approach next. We add three appendices, in the first two of 
which we derive the conditions for coprime wrapping numbers in 6D and 4D. The third 
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appendix briefly summarises the main technical tools we are using. They are expected to be 
applicable to any problem of the same sort. Whenever one wants to analyse the statistics 
of solutions to integer valued constraints, which are known to have a large number of 
solutions, a variant of our number theoretic formulas should work. 

2. Counting solutions to tadpole cancellation conditions 

In this section we formulate the problem of counting respectively computing various 
statistical distributions of phenomenologically relevant observables in the set-up of stringy 
supersymmetric intersecting D-brane models. Even though we will also discuss eight and 
six-dimensional models, let us explain the problem for the more relevant four-dimensional 
case. 

2.1. The statistical problem for general Type II orientifolds 

For concreteness we start with a Type IIB orientifold flux compactification, where 
we also allow all fluxes to vanish. The F-term scalar potential generated by the fluxes 
generically freezes the complex structure moduli and the dilaton. The techniques developed 
in []T^ , [2^ , P7| , |SS| , P^ , PB| can be used to perform a statistical analysis of the ensemble of these 
flux vacua, which for instance allows one to address questions about the distribution of 
the cosmological constant and the scale of supersymmetry breaking. 

In addition, the tadpole cancellation conditions also require the presence of D-branes, 
which from the phenomenological point of view are at least equally important, as they 
provide the stringy realization of the particle physics with all its interactions. It has been 
pointed out that to get interesting gauge theories with chiral matter content, one has to 
introduce more general branes than simply D3-branes. Both the possibility of branes on 
singularities and branes with non-trivial gauge fluxes have been proposed |]3T| , ^ . 

If one neglects the back-reaction of the fluxes on the background, i.e. for models 
at large radii with diluted fluxes, the D-brane sector is more or less independent of the 
flux sector. The only effect is the contribution of the fluxes to the R-R 4-form tadpole 
cancellation condition. 

We would like to perform a statistical analysis for the D-brane sector as well, where 
just for convenience we work from now on in the T-dual (mirror symmetric) framework 
of intersecting D-branes. Therefore we consider a Type IIA string compactiflcation on 
some Calabi-Yau manifold, M, and divide the Type IIA model by the discrete symmetry 
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Oa. Here fl denotes the world-sheet parity transformation and a an anti-holomorphic 
involution, which in the following is chosen to be simply complex conjugation in local 
coordinates. It is well known and has been studied in many examples that this quotient 
introduces topological defects in the background, so-called orientifold 06-planes, which 
carry tension and charge under the R-R seven-form. For dimensional reasons these 06- 
planes wrap 3-cycles, ttos, in the Calabi-Yau manifold and since they preserve one-half of 
the supersymmetry these 3-cycles are special Lagrangian (sLag). 

To cancel the induced tadpoles of the 06-planes one introduces -D6-branes, which 
by themselves wrap in general different 3-cycles. Specifically one introduces k stacks of 
different branes, where on each stack we have A'^^ -D6-branes wrapping the 3-cycle T^a and 
its Qa image tt^. Then the R-R tadpole cancellation condition takes the very simple form 



where we have also included the contribution of the flux. If we also require Af = 1 
supersymmetry in four dimensions the branes have to wrap sLag 3-cycles, preserving all 
the same supersymmetry. 

Similarly to the Type IIB orientifolds discussed in the homology group Hs{M) 
splits into an Oa even and odd part, Hs{M) = H^{M) (B H^{M). The even part contains 
real 3-cycles and the odd part completely imaginary ones. Moreover, fla exchanges the 
holomorphic and the anti-holomorphic 3-forms, so that the volume form 



is anti-invariant, i.e. Qa : vol(M) — vol(M). Therefore, the only non- vanishing inter- 
sections are between 3-cycles from H^{M) and H^(M). 

Let us introduce a symplectic basis {ai,(3i) of Hs{M,'^) where aj G H^{M) and 
Pi G H^{M), so that aj (1 (3j = 5ij with the other intersection numbers vanishing. We 
expand the 3-cycles of the branes and the orientifold planes as 



k 




(2.1) 



vol(M) = -O3 A O3 



(2.2) 



b3/2 



1=1 
&3/2 




(2.3) 




&3/2 



7=1 
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Therefore we get 63/2 = 1 + /i2i tadpole cancellation conditions 

k 

Na Xa,I = Ll- Lljlu^. (2.4) 



a=l 



Note that in F-theory compactifications the contribution to the 4-form tadpole is given by 
Lq = x/24 which can be larger than 32 and of order 10^. Even though in this paper we 
have a concrete model in mind, we often perform the analysis for more general values of 
L/, as we expect the statistics not to depend too strongly on the details of the model. The 
sLag condition 

3(03)k. =0 (2.5) 

takes the form 

&3/2 

Y,Y^jFj{U) = Q. (2.6) 

7=1 

Here Fj = J^^ O3. We also have to exclude anti-branes, leading to the additional condition 

3?(03)U. >0, (2.7) 



which can be expanded as 



^3/2 

J2Xa,lUl>0. (2.^ 
7=1 



The homogeneous complex structure coordinates are defined as Uj = f^^ O3. 

In general, two D-branes wrapping some 3-cycles in M have a non-trivial intersection 
number given by 

lab = Y^^ajYbJ-YajXbJ. (2.9) 
7 

It is well known that on the intersection locus we find lab chiral multiplets in the bifun- 
damental representation of the U{Na) x U (Nb) gauge symmetry supported on the pairs 
of branes. Chiral fermions from the la'a intersections transform in the symmetric and 
antisymmetric representation respectively. 

The aim is to count solutions to the tadpole cancellation conditions ( ^.41 ) under the 
constraints ( |2.6| ) and (|2.8| ). Here we assume that the contribution from the orientifold 
planes and possible other sources are fixed. Note that for a specific class of special La- 
grangians the wrapping numbers Xj and Yj are no longer independent so that it might 
indeed happen that the number of solutions to these equations is finite. In fact, the question 
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about finiteness of solutions splits into two parts. First, are there finitely many solutions 
for fixed Lj and complex structures Uj7 Second, up to moduli space identifications of 
solutions, are there only finitely many non-trivial complex structures allowed for fixed L/? 

Of course we are not only interested in the actual number of solutions and how they 
scale with the Lj, but also in computing statistical distributions of physically more inter- 
esting quantities like the rank of the gauge group, the number of families, the number of 
Standard-like models, the number of GUT like models, etc. It would also be particularly 
interesting to observe statistical correlations between different observables such as the rank 
of the gauge symmetry and the number of families, as after all these might turn out to be 
the only string theoretic predictions possible to detect on the string theory landscape. 

We will develop techniques to analytically respectively semi-analytically get a hand on 
the statistics of these brane solutions. To be concrete we will discuss our ideas elaborating 
on the examples of intersecting D-brane models on the T^, ^ and ^ ^ ^ backgrounds. 

2.2. Background on concrete toroidal models 

Let us summarise how these concrete examples as discussed in the literature fit into 
the general scheme introduced in the previous section. 

• M = T2 

For compactification on T^, a special Lagrangian submanifold is specified by two 
wrapping numbers {na,rna) around the fundamental 1-cycles. In this case these numbers 
are precisely identical to the numbers (X^, Ya) introduced in the previous section. The 
supersymmetry condition ( ^.61 ) becomes Ya = 0, which is independent of the complex 
structure U = R2/R1 on the rectangular torus and implies that all supersymmetric branes 
must lie along the x-axis, i.e. on top of the orientifold plane. The second supersymmetry 
condition (|2.8| ) becomes Xa > 0. If we did not allow for multiple wrapping (as is usually 
done in this framework), there would only exist one supersymmetric brane, namely the 
one with (X, Y) = (1, 0). Of course this is quite boring and in order to study the statistics 
of this model and to develop general tools we will allow multiple wrappings. Finally, the 
tadpole cancellation condition reads 

J2NaXa = lQ. (2.10) 
a 

22 
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In this class of special Lagrangian bulk branes is given by so-called factoris- 

able branes, which are similar to the former case defined by two pairs of wrapping num- 
bers (n/,m/). The wrapping numbers {Xi,Yi) with / = 1,2 for the ^2 invariant two- 
dimensional cycles are then given by 

Xi=nin2, X2 = mim2, 

(2.11) 

Yi=nim2j Y2 = min2. 

Note that these branes do not wrap the most general homological class, for the 2-cycle 
wrapping numbers satisfy the relation 

XiX2 = YiY2. (2.12) 

However, for a more general class we do not know what the special Lagrangians are. Via 
brane recombination it is known that there exist flat directions in the D-brane moduli 
space, corresponding to branes wrapping non-flat special Lagrangians. To avoid these 
complications, in this paper we stick to the well understood branes introduced above. 
However, keep in mind that since there are many more branes possible, the statistical 
estimates we are going to present in this paper are actually a lower bound. We believe and 
it remains to be proven that the qualitative features we derive from our analysis of this 
restricted but still large set of branes are representative. 
The untwisted tadpole cancellation conditions read 

J2NaXa,l = 8, 

(2.13) 



Note that in contrast to for instance the model discussed in , for simplicity we are only 



considering bulk branes without any twisted sector contribution (see |^T|] for a treatment 
of fractional branes in this framework). Defining O2 = {dxi + iUidyi){dx2 + iU2dy2) the 
supersymmetry conditions become 



Ui Yi + U2Y2 = 0, 
X1-U1U2X2 > 0. 
The intersection number between two bulk branes has an extra factor of two 



(2.14) 



l-ab 



2 {Xa,l Xt,2 + Xa,2 ^6,1 - l"a,l ^,2 " l"a,2 ^,0 • (2-15) 
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(2.16) 



Here the situation is very similar. One introduces three pairs of wrapping numbers 
{ni,mi), I = 1,2,3, in terms of which the 3-dimensional wrapping numbers can be ex- 
pressed as 

Xo = nin2n3, Xi=nim2m3, X2 = min2m3, Xs=mi'm2n3, 

Yo = 171117121713, Yi= 1711172173, ¥2=17x1712173, 1^3=711 77-2 ma. 

These are again not independent but satisfy a couple of relations 

Xi Yi = Xj Yj for all /, J, 
XjXj = YkYl, 
Xl {YlY = XiXjXk, 

YL{XLf = YiYjYk fovaWI^J^K^L^I, 
which by themselves are related. The tadpole cancellation conditions read 

J2NaXa,0=8, 



(2.17) 



Y,NaXa,i = -8 With 7 = 1, 2, 3. 



^2.18) 



Let us define the complex structures Uj = R2,i/Ri,i- Using O3 = Y[^i=i{dxi + iUidyi) and 
defining Uj = UjUk with I ^ J ^ K ^ I one can express the supersymmetry conditions 
as 



(2.19) 

Xo-5^X,t/, >0. 



3. Statistics of an eight dimensional toy model 

As a simple toy model showing not all the complications of the Calabi-Yau case, 
we discuss intersecting branes on T^. Recall that in this case there is only one tadpole 
cancellation condition which reads 

k 

Y,NaXa=L. (3.1) 

a=l 



The task is now to count all unordered solutions to (|3.1|) with arbitrary number of stacks 
k. This is a number theoretical problem, for which we have not found any solution in the 
literature. 
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3.1. Counting partitions 

We proceed by first discussing an easier but apparently related problem, namely to 
count unordered solutions (i.e. after dividing out permutations) to the constraint 

k 

Y,Na = L. (3.2) 

a=l 

This is nothing else than to count the number of partitions of L. In particular, we would 
be happy with a rough estimate for large L. To make this paper self-contained let us 
discuss in some detail how one can arrive at such an estimate by using the saddle point 
approximation (SPA), a well known and often applied technique from the mathematical 
theory of asymptotic approximations [|2],|3|. 

Let us discuss two slightly different approaches to this problem. First note that the 
problem of counting partitions is equivalent to counting solutions to the equation 

oo 

^/cnfc = L, (3.3) 

k=l 

where the ordering issue is solved automatically. Now one writes 



fcrtfe — L,0 

sol all 

^ / d<i^ E (3.4) 

^ nfe=0 

1 

2^i 



The last line simply extracts the order L term in the generating function for the partitions, 
which is the inverse of the well known Dedekind 77-function. A common method to evaluate 
the asymptotic expansion of such integrals is the saddle point approximation. Since we 
intend to apply this method also for counting string vacua, let us discuss it in some more 
detail. 

First we write 

M{L) = — i) dqef^'i^ (3.5) 



27ri _ 
with 

00 

f{q) = -J2 log(l - - + 1) log 9- (3.6) 



fc=i 
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To evaluate this integral one assumes that the main contribution comes from the saddle 
points qo, which are determined by the condition df /dq\q^ = 0. Here we assume that there 
exists only one saddle point. Otherwise one has to sum over all saddle points. Setting 

q = pexp{iLp) we get 

X{L) = — [ dipqef^'i\ (3.7) 
The leading order saddle point approximation is then simply given by 

ArW(L) =e-^('^"). (3.8) 

One can compute the next to leading order approximation by Taylor expansion in ip 

^2 



fiPo,V>) = fiQo) + nTT^Uo + • •• 



(3.9) 



and inserting it into (|3.7|) . Using that {d'^f/d^p 



iqo 



q {d f /dq )qf^, we obtain 



27r J-qoTT 

with X = qo(f. Thus for a sufficiently large second derivative one finally obtains 

1 gfiQo) 



(3.10) 



9g2 I go 



(3.11) 



For later use we also give the analogous result for the saddle point approximation 
when more than one variable is involved. Here one wants to approximate an integral 



1 /■ " 



(3.12) 



where / in general is of the form 

f{q)=9{q)-Y.{Li + l) \ogqi. 



(3.13) 



The saddle point is determined by the condition V f{q)\qn = 0, which after Taylor expand- 
ing f{q) around the saddle point leads to the second order saddle point approximation of 

(EH) 

Ar(2)(L) = , ' (3.14) 



det 



dqidqj 



-I go 
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Now, let us first analytically evaluate the leading order contribution of ( |3.4|) . One 
notices that for large L, the integrand quickly approaches infinity both for q < 1 and for 
g ~ 1, so that one expects a sharp minimum somewhere very close to g ^ 1. Therefore we 
write the integrand as exp(/(g)) with 

f{q) = log rr^] -(L + l) logq. (3.15) 
Close to ~ 1 we can write the first term in ( |3.15|) as 



(3.16) 




m>0 



SO that (|3.15|) becomes 



f{q)^^j^^-{L + l)logq. (3.17) 

The minimum of this function is approximately at go — 1 — \f^j so that fiqo) — 7rA/2L/3. 
To summarise, a first estimate of the growth of the partitions for large L is given by 



Ar(L) ~ e^^V^. (3.18) 

This is precisely the leading term in the celebrated Hardy-Ramanujan formula for the 
asymptotic growth of the number of partitions 

m)HR^j^e-V^\ (3.19) 

As in the more complicated examples we are going to discuss in this paper an analytic 
approach is not always feasible, let us also numerically evaluate the saddle point conditions 
and compare the exact, the leading order and the next to leading order approximations. 
The resulting three curves for Ln(A/'(L)) are shown in Figure 1. 

This shows that the second order saddle point approximation provides a very good 
estimate for the behaviour of the number of partitions. To illustrate this even more we 
have listed in Table 1 the values for the number of partitions resulting from an exact com- 
putation, the Hardy-Ramanujan formula and the second order saddle point approximation. 
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Ln ( N (L) ) 



15 : 
12.5- 




Fig. 1: Number of partitions: The solid line shows the exact number of 
partitions, the upper dotted line the leading order saddle point approximation 
and the lower dotted line the second order. 

It is quite impressive that the numerically evaluated saddle point method is even better 
than the Hardy-Ramanujan formula. 



L 


M{L) 


M{L)hr 


AA(2)(L)sPM 


1 


1 


1.88 


0.73 


2 


2 


2.72 


1.51 


3 


3 


4.09 


2.65 


4 


5 


6.10 


4.33 


5 


7 


8.94 


6.72 


6 


11 


12.88 


10.09 


7 


15 


18.27 


14.75 


8 


22 


25.54 


21.10 


9 


30 


35.25 


29.65 


10 


42 


48.10 


41.06 



Table 1: Comparison of asymptotic approximations for M {L) 

Recall that in order to get this nice result we reformulated the problem of finding 
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solutions to (p7^). In view of our original goal to find unordered solutions to the tadpole 



cancellation condition ( |3.1|) , it would be nice if we could also count the number of partitions 
using the original equation 

k 

J2Na=L. (3.20) 

a=l 

Here the number of stacks is a free parameter over which we have to sum eventually. Since 
we are only interested in unordered solutions, a rough estimate would be that we divide 
the number of ordered solutions by k\. Of course this too heavily suppresses solutions 
when some of the A'^^ are equal. However, introducing the precise combinatorial factors is 
partition dependent and does not simplify the problem. Therefore, let us see how far we 
can get with the naive k\ factor. Proceeding now analogously to the former case, we write 
the total number of solutions as 

^ fc=l Afi=l Nk = l ^ fc=l \Ar=l 




Applying again the saddle point method yields 



(3.21) 



f{q)^-^-iL + l)logq (3.22) 
1-q 

and leads to the final analytic result for the leading order approximation 

AfiL) ~ e^^. (3.23) 

We conclude that in this approximation we get the right growth and only the numerical 
coefficient is slightly smaller 

-A^ = -= ^ 1.28. (3.24) 

logM V6 

Now we can also numerically determine the saddle points and compute the leading and 
next to leading order results. These are shown in Figure 2. 

As expected we realize that for the second order approximation the curve lies below 
the exact result, but at least qualitatively gives a reasonable result. The difference is due 
to the fact that the l/k\ combinatorial factor is responsible for too large a suppression of 
partitions with identical terms. However, note that it was precisely this crude factor which 
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Ln ( N (L) ) 




L 



100 200 300 400 500 



Fig. 2: Saddle point approximation: The solid line denotes the exact num- 
ber of partitions, the dotted line the leading order SPA and the stared line 
the second order. Moreover, we have shown with triangles the second order 
approximation with the analytic factor 1.28 included. 

allowed us to proceed with the computation in the first place and to simplify the integrals 
considerably. Keeping in mind that we cannot expect to obtain always quantitatively 
completely accurate results, from now on we apply the saddle point method for counting 
solutions to the tadpole conditions. What we are mainly aiming at is to get qualitatively 
correct pictures of the distributions of various physical quantities. Comparing our results 
with a brute force computer search reveals that also the quantitative agreement in certain 
cases is fairly good. 

3.2. A comment on counting rigid branes 

The string models we are going to consider in this paper mostly have the feature that 
the D-branes allow for continuous displacements and Wilson lines, i.e. the cycles they 
wrap are not rigid. For more realistic models it is however desirable to get rid of the extra 
adjoint scalars and consider rigid cycles instead i.. Then the question arises whether this 
drastically changes the statistics of the D-brane sector. As an indication that one does not 

^ We would like to thank M.R. Douglas for encouraging us during the "String Vacuum Work- 
shop" in Munich to study the effects discussed in this section and in section 3.7. 
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expect this to happen, at least qualitatively, let us consider the analogous question for the 
related problem of counting partitions. 

For rigid branes, one is not allowed to have two stacks with identical wrapping num- 
bers. Analogously, we would like to count unordered partitions where each term can only 
appear once. This is like Fermi statistics and the exact expression for this number Q{L) 
takes the familiar form 

2^^) = i/^^^n(l + ^')- (3-25) 
Let us write the integrand as exp(/(q')) with 

f{q) = log + q'^)j-{L + 1) log q. (3.26) 

Close to Q ~ 1 we can write the first term in ( |3.26|) as 

log(llil + q')]=J2log{l + q')= Y: 

\ fc / k k,m>0 

_J. >^ (-1)"^+! _ 7r2 1 



qkm 



(3.27) 



(1-q) ^ m2 V2{\-q\ 



Proceeding in the same way as above, we find that the minimum of this function is ap- 
proximately at (Jo — 1 — \J so that 

Q{L) ~ e^v^. (3.28) 

Therefore, in the leading order saddle point approximation Af{L) and Q{L) show qualita- 
tively the same exponential growth with ^/L and 

logA/" r- 

= V2. 3.29 

log Q 

The leading order saddle point approximation gives the exponential term in the well known 
asymptotic expansion for Q{L) 

1 



Q{L) ^ e-V V3. (3.30) 

4.34 L4 

We consider these results as evidence that qualitatively the distributions of physical ob- 
servables do not change for rigid branes. 
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3.3. Counting tadpole solutions 

Now that we have convinced ourselves that, from a pragmatic point of view, the 
second approach from section (3.1) gives already a good estimate, we want to apply the 
same method to count the solutions of the tadpole condition 

k 

Y,NaXa = L. (3.31) 

a=l 

By the same reasoning we compute 

^ k=l Nt_ = 1Xi = 1 N, 

so that the saddle point function / reads 

q 



(3.32) 



L X 



Q 

x=i ^ 



Close to Q ~ 1 we find the analytical expression 

L 



f^^) = E -(L + l) logq. (3.33) 



f{q) ^ J_ ^ 1 - L logg ^ ^ - L logg. (3.34) 

In this approximation the saddle point is at qmin = 1 — \f^^^ so that 

U{L) ~ eV^i°g^. (3.35) 

A rough intuitive understanding of this result can be gained as follows. In order to solve 
( p.31| ) one first divides L into its partitions and then one writes each term as a product of 
two positive integer numbers. We know already that the number of partitions scales like 
g2\/r ^Yyq other hand it is known in number theory that the function ao(n) of divisors 
of an integer n has the property 

1 ^ 

- J] ao(n) ^ logL + (27^; - 1), (3.36) 

n=l 
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where 7^; denotes the Euler-Mascheroni constant. We think that this explains the appear- 
ance of the (logL) factor in ( |3.35|) . Of course, further L-dependent corrections wiU arise 
from the sub-leading second order terms in the SPA. 

In order to test these results one can now compare the three different approaches to 
unravel the statistics of solutions of the tadpole equation. 

• First the numerical approach, where one performs for varying L a brute force computer 
search for all solutions to the tadpole conditions. This is the most time consuming 
and less elegant method but it really gives the exact number of solutions. 

• Second, we can use the analytical saddle point function f{q) (|3.33|) and using MATH- 



EMATICA determine numerically the saddle points for varying L. Here only the sum 
over all possible wrapping numbers for just a single stack of branes occurs, which a 
computer can easily handle in 0(1) seconds. We call this approach semi-analytic and 
it will be the most powerful one for the more complicated 6D and 4D models. 
• Third, one can try to estimate the locations of the saddle points and derive an analytic 
expression for Af{L). In the easy example above it was possible to do so, but in more 
complicated examples it gets more and more complicated. We call this approach the 
analytic one. 

In Figure 3 we display the result for the total number of solutions to the tadpole 
cancellation condition for varying number of L. Here we compare the result of the exact 
numerical analysis with the outcome of the second order semi-analytic SPA. 

N(L) 

5r 



3 - 



10 15 20 25 



Fig. 3: Number of solutions for varying L. The dotted line denotes the 
result of the exact analytic computer search and the solid line the outcome 
of the second order SPA. 
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As expected the saddle point curve lies below the exact results, but nevertheless 
captures all of the qualitative features. 



3.4- Probability of SU{M) gauge symmetry 

We are also interested in the percentage of models containing at least one SU{M) 
gauge factor. With the same methods as above this can be written as 



L X ^ L oo 



2mM{C) 

The corresponding saddle point function reads 



(3.37) 



r /I n^^\ 

/(^)^Et^-.o4."(\^) 



-(L + l)logQ. (3.38) 



We can either numerically search for saddle points of this function or we can observe that 
for M <^ L the second, M dependent term in ( p.38|) is just a small perturbation. In this 
case we expect that in leading order the saddle point does not change, so that we have 



only to evaluate ( p.38|) at Qmin = 1 — y ^^^r^- Doing this for large L one expects that the 
probability to find an SU (M) gauge factor scales like 



P{M) ~ exp I -W ) (3.39) 



with the normalisation chosen appropriately. In Figure 4 we have shown the distribution 
of probabilities for at least one SU{M) factor for L=25. 

We notice that the exact and the semi-analytic computation nicely agree. Since we 
are computing a normalized quantity the quantitative agreement is even better than for 
the total number of states. 

Analogously, the probability to find a product gauge symmetry Yli'^Ui^i) 
^^Mi<^L is 

P(M) ^ exp (^-^^Y^M^j , (3.40) 
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-4 
-6 
-8 
-10 

Fig. 4: Probability of at least one SU (M) gauge factor for L=25. The dotted 
line shows the exact computer search result and the solid line the second order 
SPA. 

which shows that in this approximation it only depends on the rank of the gauge symmetry. 
Therefore, in this regime we have mutual independence of the occurrences of SU (M) gauge 
factors, i.e. 

P(M) = J]P(Mi). (3.41) 

i 

This behaviour is expected to break down for L. 

3.5. The average rank 

Another gauge theoretic quantity we can compute is the expectation value of the rank 
of the gauge group in the ensemble of solutions to the tadpole cancellation condition. The 
integral for the saddle point approximation can be simplified and written in the form 

•' ^ ^ k=l Ni = lXi=l Nk = lXk = l \ a J 

" mm^) / ^''P ( t ^ + (e (13^) - (i + 1) log . j . 

(3.42) 

For L = 25 we obtain the curves in Figure 5. 

Apparently, the SPA provides a very good estimate and the average rank scales like 

L/2. 
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<rank> 

15 - 




Fig. 5: The average rank. The dotted hne shows the exact computer search 
result and the sohd hne the second order SPA. 

3.6. The rank distribution 

Next we would like to investigate the probability to get a gauge group of rank r. This 
means that we also have to implement the constraint 



Y,Na = r, (3.43) 



o=l 

which we again do by writing the Kronecker delta as a contour integral. Thus, 

oo L oo L 



^ ^ fc=l ATi^lXi^l Nk=lXk=l 



(3.44) 



with the corresponding saddle point function 



^) = E i_ X -{L + l)logq-{r + l)\ogz. (3.45) 

Numerically determining the saddle point in the two variables q and z we find the Gaussian 
like distribution shown in Figure 6, where we have also displayed the exact numerical result. 
Again the qualitative agreement is good, the exact curve simply reflects the number theory 
behind this distribution. 
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Fig. 6: The rank distribution for L = 25. The dots show the exact results 
and the solid line the distribution coming from the SPA. 

Let us give a heuristic argument for the appearance of the Gaufi curve. We have 
already found that the likelihood for the occurrence of a gauge group of rank r is 
exp(— A/7~^r) with 7 = L/logL. In addition, we have the partition of r ways to dis- 
tribute the Na- Therefore, the probability to find rank r should behave like 

P(r) ~ exp (2y/¥ - 7-^/V) ~ exp (^- ^^ ^ . (3.46) 

Prom this argument we expect the maximum of the distribution to be at Vmax = L/ logL. 
The width of the Gaussian is Ar = \/L/ logL, so that the relative width Ar/rmin = 
^JlogL/ L decreases for larger L. 

Since supersymmetry requires Ya = for all D-branes, in this case the intersection 
number lab between pairs of branes always vanishes. In this respect our simple toy model 
is not general enough, so that now we move forward and generalise the methods developed 
in this section to the more interesting 6D and 4D models. 

3.7. A comment on counting quantum vacua 

In this paper we mainly consider the statistics of classical vacua of intersecting D- 
brane models. In fl^ it was suggested that one should also count quantum vacua, in the 
sense that the number of such vacua of a four-dimensional M = 1 supersymmetric gauge 
theory is given by the Witten index |^^. For a pure SU{N) gauge theory this index is 



given by A^. In this section we would like to briefly discuss what happens when we take 
such an A-fold degenerary into account for our toy model. 
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The number of quantum vacua would be given by 

^ k=l ' 7Vi = lXi = l Nk = lX^: = l \ a / 

^ k=l \X=1 ^ 



(3.47) 



^ k=i \x=i ^ ^ 
so that the saddle point function / reads 

/(?) = i: TT^^ -(L + l) logg. (3.48) 

x=i ^ ^ ' 

Close to Q ~ 1 we find the analytical expression 

1 ^1 TT^ 1 

/ 2\l/3 

In this approximation the saddle point is at qmin = 1 ~ ( fx ) ih.dX the leading order 

saddle point approximation reads 

A/'q(L) ~ exp (^aLt) (3.50) 

with a = 3,^Ji|73 - 1.94. 

In Figure 7 we show the numerical leading order saddle point approximation con- 
firming the scaling ( |3.5C1| ). This admittedly very simplified picture shows that taking the 
quantum degeneracy into account induces a slight change in the exponential growth 

(3.51) 

logAT 



4. Statistics of a 6D model 

In this section, employing analogous methods as developed in the previous one we 
investigate the statistics of the six-dimensional intersecting brane model on the orientifold 
T^/Za- The two main new features as compared to the 8D case are the complex structure 
dependence of the supersymmetry conditions and the possibility of chiral supersymmetric 
models. To begin with, let us show that the model actually admits only a finite number 
of inequivalent solutions to the tadpole cancellation conditions. 
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Fig. 7: The total number of quantum vacua for L = 50. The dots show 
the leading order saddle point approximation and the solid line the curve 
2.2 ■ Ll/loglO. 

4-1. Finiteness of solutions 

Recall the general features of this model mentioned in section 2. Besides these we will 
show in appendix A that the relative coprime property of the wrapping numbers (n/, mj) 
translates into the following relation for the wrapping numbers {Xi,Yi) 



gcd(Xi,y2)gcd(X2,y2) = i^2 



(4.1) 



Here we have assumed non- vanishing Xj and Yj. In configurations with some wrapping 
numbers vanishing we only admit (Xi, X2) G {(0, —1), (1, 0)} with Yi = 1^2 = to exclude 
multiple wrapping. 

In order to decide if the number of stable solutions is finite, we need to extract the 



restrictions on possible choices of Uj imposed by supersymmetry (|2.14[ ) and the tadpole 
cancellation conditions (|2.13| ) . It is clear that the supersymmetry conditions can only have 
non-trivial solutions if U1/U2 G (Q. Now, writing U1/U2 = Ui/u2 for coprime integers 
Wi,W2, we find from (|2.14| ) that Yi = W2 2/1 for some integer yi, in terms of which (|2.12| ) 
reads 

X^X2^-uiU2iyi)^. (4.2) 

It follows that at least one of the Xj has to be non-vanishing, in which case the second 
supersymmetry condition becomes vacuous once ( |2.13| ) and ( ^.14| ) are satisfied. Note 
furthermore that due to the reflection symmetry of the wrapping numbers we will stick in 
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the following to the case that Xa^i > and change X2 —X2 so that all Xj are now 
positive. 

To summarise, we have to analyse all possible solutions to the equations 

k 

J2NaXa,l=Li, 

a=l 
k 

Y,NaXa,2=L2 

a=l 

for non-negative integers (Xa,i, Xa,2) 7^ (0,0) such that 

a:=./?SiZ^eZ (4.4) 

and 

gcd(Xa,i, Ul a) gcd(Xa,2, Ul a) = ui a. (4.5) 

This means that for fixed complex structures and bounds Li and L2 the number of su- 
persymmetric solutions to the tadpole cancellation conditions is finite. We denote the set 
of wrapping numbers Xi, X2 satisfying ([4.4| ) for fixed complex structure as Su- Note that 
we allow for different stacks having the same wrapping numbers, which means that the 
branes do have open string moduli corresponding to the position and continuous Wilson 
lines of the branes. 

It remains to be seen that also only a finite number of complex structures are allowed. 
In this respect we realise that for given Li, L2 solutions with at least one stack of branes 
where Xi > and X2 > are possible only for those complex structures satisfying 

Ul U2 < Li L2. (4.6) 

In those cases, the factorisability of the branes and super symmetry restrict the admissible 
ratios of complex structures to only a finite number of possibilities. Of course, we can still 
rescale Ui and U2 by the same factor, which implies that in these models only one of the 
two complex structure moduli is fixed and one is left as a free parameter. 

From what we said, it is appropriate to distinguish two classes of models. In the 
first class we have ui U2 < Li L2 and both branes satisfying ( Xi > and X2 > 0) and 
branes satisfying {Xi = or X2 = 0) are present. If a solution to the tadpole cancellation 
condition contains at least one brane with ( Xi > and X2 > 0), then one complex 
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4.3 



structure modulus is fixed, namely Ui/U2- If the solution only contains branes of the 
other type, then both complex structures are free parameters. The second class consists 
of configurations where ui > Li L2 and only branes satisfying {Xi = or X2 = 0) 
are present. For coprime wrapping numbers these are only two branes. Apparently, all 
the solutions to the tadpole cancellation conditions we can get are already contained in 
the first class, so that we can dismiss the second class. The upshot of this discussion is 
that, if wc count solutions to the tadpole cancellation conditions modulo moduli space 
identifications, then the overall number of solutions to the tadpole cancellation conditions 
is finite. 

Given the supersymmetry constraints above, one can simplify further the intersection 
number 

lab = -2 {Xa,l Xt,2 + Xa,2 ^6,1 + ^a,! ^,2 + Ya,2 ^,1) (4-7) 

between two 3-cycles. After some little algebra one arrives at the expressions 

lab = —2 (^■\/X^~iX^ — \/X^^^2X^^ , 
la'b = —2 ^\/ Xa,i Xi,^2 + \/ -^a,2 -^6,l) ' 



(4.8) 



which immediately imply = and la' a = —8Xa,i Xa^2- Apparently, the intersection 
number is negative definite. This is well in accord with the expectation from J\f = 1 
supersymmetry in six dimensions, namely that all bifundamental matter fields transform 
in hypermultiplets and not in vectormultiplets. For the intersection of the orientifold plane 
with a D-brane we get 

IOa = -^{Xa,l+Xa,2). (4.9) 

4.2. Counting tadpole solutions 

Now we proceed completely analogously to the 8D example and first compute the total 
number of solutions to the tadpole cancellation conditions for fixed complex structures, 
Ui,U2- Again in the occurring integrals the sums over the numbers of branes Na can be 
carried out explicitly and yield geometric series. One can write the final result as 



Af{Li, L2) ~ ttAto / dqi dq2 exp( V - 



- (Li + 1) log qi - (L2 + 1) log q2 
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Fig. 8: The total number of models for L2 = 8, varying Li and {ui = 1,U2 = 
1). The upper dots show the distribution with multiple wrapping numbers 
allowed and the lower stared line the case with coprime wrapping numbers. 

The asymptotic growth of this expression can be deduced by the saddle point approxima- 
tion now in two variables qi,q2- In Figure 8 we show the result of a numerical evaluation 
of this saddle point for the case of L2 = 8 and varying Li. As expected the curve for 
coprime wrapping numbers lies under the line with multiple wrappings allowed. 

For L2 = 16 we get the result shown in Figure 9. Quite surprisingly for fixed L2 this 
still scales like ex.p{y/Li logLi) so that the 8D toy model gave already a good impression 
of how the situation is for more realistic models. 
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Fig. 9: The total number of models for L2 = 16 and {ui = 1, tt2 = 1) 



For the case of coprime wrapping numbers we have also carried out a brute force 
computer search for all solutions to the tadpole conditions. The resulting curve for the 
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total number of solutions is plotted in Figure 10. As expected from the too strong l/k\ 
suppression, the saddle point approximation lies below the exact curve. However, qualita- 
tively we find the same scaling and if we multiply the saddle point result by the factor 1.28 
found for the number of partitions in section 3.1, we indeed find quite good agreement. 
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Fig. 10: The total number of models for L2 = 8 and {ui = 1,^2 = 1). 
The dotted line is the result of the exact computer search, the solid line 
the prediction from the second order SPA and the dashed line the latter 
distribution scaled with the factor 1.28. 



Ji..3. Probability of SU{M) gauge symmetry 

Next for fixed Lj we investigate the probability to find at least one SU{M) gauge 
factor, which leads to the expression 

+ log ( Yl ^^""^ ]-{Li + l) log gi - (L2 + 1) log g2 

(4.11) 

In Figure 11 we show the resulting distribution for Li = 8 and L2 = 8 and realise that as 
in the 8D case it still decreases exponentially with M. 

For the choice Li = 16 and L2 = 16 the distribution looks like shown in Figure 12. For 
- Mi <^ Li + L2 we again find that the occurrence of a gauge factor SU {Mi) is given 
by the product of the probabilities for the occurrence of each factor, P{M) = Yii P{Mi). 
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Fig. 11: The probability for finding at least one SU{M) gauge factor for 
Li = L2 = 8 and complex structure {ui = l,tt2 = 1). The dotted line is the 
result of the exact computer search for coprime wrapping numbers, which has 
to be compared with the solid line showing the second order SPA. 
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Fig. 12: The probability for finding at least one SU{M) gauge factor for 
Li = L2 = 16. The dotted line is the SPA for non-coprime wrapping, the 
stared line gives the result for coprime wrapping numbers. 



4-4 ■ The average rank 



In this section we would like to compare the expectation value of the rank of the gauge 
group for fixed L2 and varying Li as computed from the exact computer search and the 
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second order SPA. The integral for the latter can be simplified and written in the form 



1 - Qi' Q2' 



+ E TT^SfS^ -(^i + i)ioggi-(i^2 + i)iogg. . 

(4.12) 

As in the 8D case, the average rank is approximately at (Li + L2)/2. For L2 = 8 and 
(■Ui,tt2) = (1,1) we have obtained the two curves in Figure 13. Apparently, the SPA 
provides a very good estimate. 
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Fig. 13: The average rank for coprime wrapping numbers with fixed L2 = 8 
and (ui,U2) = (l^l)- The dots show the exact computer search result and 
the solid line the second order SPA. 



4.-5. The rank distribution 

The likelihood to find a gauge group of rank r = Na can be determined from the 
integral 



P(r,Lj) 



// ^ ^ qXi X2 
dqi dq2 dz exp V ^ ~ ("^i + ^) ^1 



j\^(Li,L2)(27rz)3 



(L2 + 1) log g2 - (r + 1) log 2; ) . 



(4.13) 

In Figure 14 we show the distribution of the rank of the gauge group for coprime wrapping 
numbers both for the exact computation and the second order SPA. We see that the exact 
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Fig. 14: The rank distribution for Li = 10, L2 = 8 and (wi, U2) = (1, 1) and 
coprime wrapping numbers. The dots are the exact results and the sohd hne 
the SPA. 

values scatter around the saddle point result. Since we are not really at large values of the 
L/, this is to be expected. The exact results still show number theoretic deviations from 
the probably smooth asymptotic behaviour. 

Figure 15 displays the saddle point result for the distribution of the rank of the 
gauge group both for coprime and non-coprime wrapping numbers and L/ = 8. For 
non-coprime wrapping numbers, the distribution again is Gaussian, where the maximum 
lies approximately at (Li + L2)/2. For coprime wrapping numbers the distribution also 
looks Gaussian, where however the maximum has been shifted towards larger values of the 
rank. This can be understood from the fact that for large values of the wrapping numbers 
(Xi,X2) the coprime condition becomes harder to satisfy. Large values of (Xi,X2) are 
therefore suppressed, which due to the tadpole cancellation conditions means that smaller 
values of Na and therefore of the rank appear less frequently. 

Let us also discuss what happens for other values of the complex structure moduli. 
One example for coprime wrapping numbers is shown in Figure 16, where we compare for 
Li = L2 = 16 the distribution for {uijU2) = (1,1) and (wi,W2) = (20,1). Apparently, 
for large complex structures larger values of the rank are preferred and the nice Gaussian 
shape of the distribution is lost or rather the Gauss curve has been shifted so much to 
large values of r that only part of the left wing is visible. 
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Fig. 15: The rank distribution for Li — L2 — 8 and (wi, W2) = (1, !)• Dots: 
multiple wrapping, stars: coprime wrapping numbers. 
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Fig. 16: Comparison of rank distributions for complex structures [ui, U2) = 
(1, 1) (empty stars) and (ui,U2) = (20, 1) (filled stars) in the case of coprime 
wrappings and Li — L2 = 16. 

4.6. Sum over complex structures 

So far we have just evaluated the various distributions for the case that the Li and 
complex structures are fixed parameters. Even though for concrete applications the Lj 
are indeed fixed, the complex structures are not. The values at which they arc frozen by 
the supersymmetry constraints depend on the concrete solution of the tadpole cancellation 
conditions. In Figure 17 we show the allowed values of the coprime numbers {ui,U2)- 

It is now an interesting question whether the summation over all these complex struc- 
tures destroys the qualitative features we have seen for instance for the distribution of the 
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Fig. 17: Allowed values of complex structures for Li = L2 = 8 

rank of the gauge group. It could very well be that any distinctive feature gets lost and 
one eventually obtains a uniform distribution. We have carried out such an additional 
average for the case L/ = 8 with the result shown in Figure 18. Apparently, we still get 
shifted Gauss like curves, which shows that the average over the complex structures is far 
from leading to a uniform distribution. 
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Fig. 18: Total distribution of the rank of the gauge group after adding up 
all possible complex structures. We choose Li = L2 = 8. Dots: multiple 
wrapping, stars: coprime wrapping numbers. 

Jf..!. The chirality distribution 

In order to proceed, one has to choose a good measure for the chirality in the system. 
One possibility would be to compute the distribution of intersection numbers lab between 
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different stacks of branes. This is precisely what we wiU propose in the 4D case, but since 
the 6D case is not of phenomenological importance we choose a simpler quantity involving 
only one set of branes like la' a = —8X^,1 Xa,2- This index counts the net number of chiral 
fermions in the anti-symmetric plus symmetric representation of the gauge group. Since 
this index only depends on the wrapping numbers, the constraint Xa,iXa,2 = X can be 
easily implemented in the saddle point integrals. The sum over one stack is restricted to 
those wrapping numbers Su,x satisfying the constraint. 

For the distribution of this intersection number in the ensemble of 6D intersecting 
brane models with fixed complex structure and Lj one obtains 

1 




(4.14) 



it is clear that for (mi,U2) = (1, 1) only those 



- (Li + 1) logQi - 

Due to the supersymmetry constraint 
X appear which can be written as a square x = a^. In Figure 19 we show the resulting 
distribution for Lj = 8. It is obvious that small values for the chirality index are favoured. 
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Fig. 19: The chirality distributions for L/ = 8 and (^1,^2) = (1, !)■ Dots: 
multiple wrapping, stars: coprime wrapping numbers. 



To get a better impression about the scaling, we show for the choice Lj = 16 and 
(wi, U2) = (1, 1) a logarithmic plot for non-coprime wrapping numbers in Figure 20. From 
there we deduce that the scaling is roughly like P{x) — exp(— k^/x), which could be ex- 
pected as the wrapping numbers X/ „ and the numbers of branes A^^ appear symmetrically 
in the tadpole cancellation conditions. 
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Fig. 20: The chirality distributions for L/ = 16, {ui,U2) = (1,1) and non- 
coprime wrapping numbers. 

4-8. Statistical correlations 

To conclude our discussion of the statistics of the 6D model, we compute the distri- 
bution of the rank of the gauge group and the chirality index at the same time. We are 
particularly interested in searching for a statistical correlation between these two quanti- 
ties, as expected from the tadpole cancellation conditions, which involve the Na related to 
the rank and the Xa related to %. The distribution is given by 



- (Li + 1) log gi - (L2 + 1) log q2- {r + 1) log z 

Evaluating this threefold integral via a numerical saddle point approximation results in 
the plot shown in figure 21. 

From this plot one can already guess that the maximum of r for fixed x depends on 
X- In fact it appears that for larger x the r distribution moves towards smaller values 
of the rank. Since the rank only depends on the various numbers of D-branes and the 
chirality only on the wrapping numbers, the tadpole cancellation conditions imply that 
these numbers are not independent. In fact larger values of x mean smaller values of 
the Na- What we see in the figure is the statistical manifestation of this string theoretic 
constraint. By looking at the plot from the another direction this correlation is even more 
obvious. 
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Fig. 21: Rank-chirality correlation for non-coprime wrapping numbers, L/ = 
8 and (wi, U2) = (1, 1). 
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Fig. 22: Rank-chirality correlation 

We can also directly compute how for fixed values of the chirality x the position of 
the maximum of r) shifts towards smaller values of r. The result is shown in Figure 
23. 

We conclude from this 6D intersecting D-brane model that the stringy tadpole can- 
cellation conditions imply a statistical correlation of the rank of the gauge group and its 
chiral matter content. 

5. Statistics of a 4D model 

Now we come to the physically most interesting case of a four dimensional model. In 
this section we first proceed completely analogously to the higher dimensional examples 
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Fig. 23: Shift of the maximum in r for different values of x- 

and discuss the various distributions for essentiaUy fixed Lj. However, it turned out 
that some computations already become quite time consuming so that we postpone their 
presentation to a future publication |Q. Instead, in section 6 we will combine the gauge 
theory sector in the T-dual picture with a flux compactification and perform the statistics 
taking both sectors into account. 

We now consider intersecting branes on the j'S.i x ^2 orbifold without discrete 
torsion [^. As compared to the 6D case, this model shows some new features, e.g. the 
possibility of negative contributions to the tadpole cancellation conditions from supersym- 
metric branes. Therefore, it seems less obvious that also for fixed complex structure only 
a finite number of solutions exists. We will argue in the next section that this is indeed 
the case. The reader who is not so interested in these technical details may jump directly 
to section 5.2. 

5.1. Finiteness of solutions 

After recalling the definitions of the model from section 2.2, we would like to add 
that the coprime condition on the wrapping numbers (nj, mj) translates into the following 
condition on the 3-cycle wrapping numbers 



The proof for this can be found in Appendix B. Now we would like to evaluate the various 
conditions the wrapping numbers [Xj, Yj) have to satisfy in order to see whether finitely 
many solutions exist. 



3 





7=1 
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First, we want to write the expressions more symmetrically and transform {Xj, Yj) 
[—Xi,—Yi) for / = 1,2,3. Moreover we define Uq = 1, which allows us to write the 
supersymmetry conditions as 

^ 1 

0, 



_ (5.2) 
I 

whereas the general consistency conditions are 

Xi Yi = Xj Yj for aU /, J, 
XiXj = -YkYl, 

(5.3) 

Xl (Yl) = —Xj Xj Xk, 

Yl (Xl^ = -Yi YjYk for all / ^ J ^ K ^ L ^ /. 
Multiplying each of the four tadpole equations 



hj Ui and adding them all up yields together with ( p7^ ) the constraint 

3 

Q <Y^XiUi <^LiUi. (5.5) 

7=0 I 

Now let us distinguish the three cases that either one, two or four of the Xj are non- 
vanishing. The case that only one Xi vanishes is excluded by ( ^.3|) . 

For the case that only one Xj is non-vanishing, the brane lies precisely on top of one 
of the four orientifold planes and supersymmetry imposes no constraint on the complex 
structures. 

In the case that two Xi are non-vanishing, we are exactly in the situation we discussed 
for the 6D model. Supersymmetry implies that the Xi > so that the contributions to 
the tadpole conditions are non-negative. In addition, one of the three complex structures 
is fixed at a rational value, whereas the other two remain as free parameters. 



Some new aspects appear for the case of all Xj no n- vanishing. The constraints (5.3) 
imply immediately that an odd number of the Xj have to be negative. 

If three < and, say, Xa > 0, then the supersymmetry condition can be written 



as 



Ui Ua\ V x,u, 
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which imphes Ui < —Xa Ua for all i. However, this is in contradiction with the other 
supersymmetry condition 

so that we conclude that no super symmetric branes with three Xi < exist. 

What remains is the case that three Xi > and one Xa < 0. In this case the same 
argument as above yields Xi Ui > —Xa Ua and does not give any contradiction. Note that 
the supersymmetry condition ( |5.6| ) allows us to express Xa in terms of the positive Xi 

Xa = -^ru^- (5-8) 



The upper bound in (|5.5|) can be further evaluated as 



J2LiUi> Xa Ua + Uj >X,U^>0 (5.9) 

I j 

for any of the three possible i, where we have used the relation Xi Ui > ~Xa Ua- 

As in the 6D case, we can argue that if we take a sufficient number of these branes, 
then the complex structures are fixed at rational values. So let us write Uj = ui^2/ui^i 
with ui^2iUi^i G IN and relatively coprime. Then the constraint (|0|) can be expressed as 



ioY P ^ Q ^ R ^ S ^ P and i^J^K^L^i. Thus, we conclude that for fixed 
complex structures only a finite number of branes is admissible. It is much harder to see 
that for fixed L/ also only a finite number of complex structures allows any solution to the 
constraint ( |5.10| ). A computer analysis however suggests that this is indeed the case. If this 
is really true, then analogously to the 6D case, we would argue that modulo moduli space 
identifications, for fixed Lj there exists only a finite number of solutions to the tadpole 
cancellation conditions. 
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5.2. Counting tadpole solutions 

Now we proceed completely along the lines of the higher dimensional examples and 
compute the total number of solutions to the tadpole cancellation conditions for fixed 
complex structures t/i, 1^3- Let us denote by Su the total set of the three classes of 
super symmetric branes described in the last section. By now we are quite familiar with 
the integrals appearing so that we can present them without too many comments, 

M{Li)'^-^—^ldqodqidq2dqze^^( V ^^^^ ^ - y](Lj + 1) loggj) . (5.II) 

The asymptotic growth of this expression can be deduced by the SPA now in the four 
variables qo, qi, q2T 13- In Figure 24 we have shown the result of a numerical evaluation of 
this saddle point. 
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Fig. 24: The total number of solutions for Li = L2 = I/3 = 8. Dots: 
multiple wrapping, stars: coprime wrapping numbers. 

As compared to the 8D and 4D models, a new feature appears here which is rooted in 
the possibility to have supersymmetric D-brane models even for the case Lq < 0. Looking 
at Figure 24, we observe that the curve for Lq > still scales like exp(^Li logLi), even 
though one could have guessed that the new class of branes with negative Xj somehow 
changes the behaviour. However, for the case Lq < these latter branes become relevant, 
as without at least one of them no solution to the tadpole cancellation conditions can exist. 
However, these branes have non-trivial winding on all three T^s, so that they are rather 
long leaving little room for other branes. Therefore, we expect that for Lq < the number 
of solutions decreases dramatically as can be nicely seen in Figure 24. 
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5.3. Probability of SU{M) gauge symmetry 

For fixed L/ the probabifity to find an SU (M) gauge factor is given by 



\Xi^Su \ I J J I 



(5.12) 



(L7 + l)log9/ . 



In Figure 25 we show the resulting distribution for L/ = 4V/ and reahze that it also 
decreases exponentially for larger M. 
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Fig. 25: The probability for finding at least one SU(M) gauge factor for L/ = 
4, V/. The result for non-coprime/coprime wrapping numbers is represented 
by the dots/stars. 

For Mi <S Ylii we again find that the occurrence of a gauge factor YYi SU (Mi) 
is given by the product of the probabilities for the occurrence of each factor, P{M) = 
UiPiMi). 

For realising the Standard Model with four intersecting branes, we learn that an upper 
bound of their probability is Psm = -P(3)-P(2)P(1)P(1). Of course, more conditions arise 
when we require that the hypercharge U{1)y remains massless after taking the Green- 
Schwarz mechanism into account. 



5.4- The rank distribution 

The likelihood to find a gauge group of rank r 
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= can be determined from the 



integral 



-^(Lj + l)log?/- (r + l)log2;y 

In Figure 26 we show the resulting rank distribution ioz Lq = Lx — Li = L3 
and both non-coprime and coprime wrapping numbers. 



(5.13) 
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Fig. 26: The rank distribution for Lq = Li = L2 = L3 = 8, C// = 1 and 
both non-coprime (dots) and coprime (stars) wrapping numbers. 

Since it is interesting to see what happens on the new branch Lq < 0, in Figure 27 
we displayed the rank distributions for Li = L2 = L3 = 8 and the choices Lq = 8 and 
= -2. 

For > the distribution still has the familiar Gaussian shape, where the maximum 
lies approximately at Lj) /2. For Lq < the shape of the curve is still Gaussian, but 
the maximum is at a much smaller value than r = (^j L/)/2. This can be understood by 
the fact that in this case the special set of supersymmetric branes with some Xa < has 
to be present. Since these branes are rather long, one expects the rank of the gauge group 
to be reduced. 

5.5. The chirality distribution 

Recall that in 6D we defined a measure for the chirality in the model by the number 
of symmetric plus antisymmetric representations of a gauge factor. Here, to be closer to 
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Fig. 27: The rank distribution for Li = L2 = Ls = 8, Uj = 1 and non- 
coprime wrapping numbers. The dotted curve is for Lq = —2 and the one 
with triangles for Lq — 8. 

the question of the probabihty of a three generation model, it is more appropriate to take 
the intersection number lab between two different stacks of D-branes. More accurately we 
should take the number 

X^Ia'b-Iab = 2YaXb, (5.14) 

which in our case of rectangular tori is always even, but can become odd for tilted ones 
i5[] . For the distribution of this intersection number in the ensemble of 4D intersecting 



brane models with fixed complex structure and L/ one obtains 



1 



'AfiLi) {2ni) 



I) exp 



n^a,I 



J2iLi + l)\ogqi 



(5.15) 

Our results for the chirality distribution displayed in Figure 28 clearly exhibit number 
theoretical effects in that the values for prime ^ tend to scatter around the approximate 
straight line formed by the other dots. Although much weaker, this effect can already be 
observed in the corresponding 6D chirality distribution in Figure 20 and merely reflects 
the special factorization properties of prime numbers. 
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Fig. 28: The chirality distribution for Lq = Li — L2 — = 8, Uj = 1 and 
multiple wrapping. 

5.6. Statistical correlations 

Finally, the statistical correlation between the rank of the gauge group and the chirality 
can be computed as 



2iog( V ^Hi^j+iogl V _tMm.^ — turn 

^ (L/ + 1 ) log 97 - (r + 1) log 2; 



(5.16) 

As can be seen from Figure 29, the maximum of the rank distribution takes smaller values 
for increasing chirality, in agreement with the naive expectation from the tadpole cancel- 
lation conditions. 



6. Combination with flux compactification 

So far we have only discussed the case where the gauge theory sector of intersecting 
branes is completely decoupled from the flux sector. That means that in a concrete setting 
the non D-brane contributions L/ to the tadpole cancellation conditions are fixed by the 
orientifold planes. However, it is known that three-from fluxes in the Type IIB T-dual 
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Fig. 29: Rank-chirality correlation for Lq = Li = L2 = Ls = 8, Uj = 1 and 
multiple wrapping. 



models also give a positive contribution to the 4-form tadpole condition. Therefore, the 
effective non D-brane contribution to the 4-form tadpole is Lq — Njiux- To avoid confusion 
in the following, please note that the former complex structure moduli in the T-dual model 
become Kahler moduli and vice versa. 

In this section we would like to get a rough insight into the statistical implications in 
the gauge theory sector if we also take the degeneration of flux vacua into account. In 



a formula for the number of flux vacua for given Nfi^x < L* was derived, which had the 
peculiar scaling 

Ar~(L*)^, (6.1) 

where K is the number of three-cycles. This scaling behaviour is a good approximation as 
long as L* ^ K, in which case the discrete sums could be estimated by continuous integrals 



|2|,|6|. In the case we are discussing here, we have L* < 8 and K ~ 10, if we only allow 
for bulk fluxes, and K ~ 100, if we consider twisted 3-form fluxes, too. Therefore, we are 
not really in the regime where we can trust the scaling ( |6.1| ). Nevertheless, as shown in 



one can still flnd polynomial scalings for L* ~ so that we use ( |6.1|) as a rough estimate 
of what one can expect. In fact, we assume that the derivative of (|6.1|) gives the number of 
flux vacua for flxed N = L*. Note that the aim here is only to get a flrst glimpse of what 
can happen when one combines the flux statistics with the D-brane statistics. Note also 
that Nfiux is bounded from above by the requirement that the conditions ( p.9| ) admit any 
non-trivial solution. Only branes with all Xi 7^ can compensate a negative Lq — Nfiux- 
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In the following, we leave the precise value of K in (|6.1j ) open and check how the 
various gauge sector distributions change depending on the value of K. To determine the 
statistics, we now have to sum over all possible values of Nfiux and weight each term with 
the degeneracy (|6.1|) . Since we expect the distributions not to depend strongly on the 
Type IIA complex structure moduli (Type IIB Kahler moduli), we are just choosing the 
specific value Uj = 1 for simplicity. For the rank distribution for instance we now get the 
following expression 



P(r) 



flux 



{Njiux + 1)^ M{r- Lo - Nfiux, Li, L2, L3), 



(6.2) 



where A/'(r, Lj) is just the unnormalised part of the distribution (|5.13|) and Nnorm the 
new normalization constant. We introduced the factor {Nfi^x + 1) so that Nfi^x = also 
contributes non-trivially. In figure 30 we show the resulting distribution for the ^2 x Z2 
orientifold. 
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Fig. 30: The rank distribution after averaging over flux vacua for Lq = Li 
L2 = L3 = 8, t/7 = 1 and NJl^^ = 11. 
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Fig. 31: The rank distribution after averaging over flux vacua for Lq = Li = 
L2 = L3 = 8, C/j = 1 and NJl^^ = 11. 

From another angle the same distribution looks like shown in Figure 31. 

For a large range of values for K, one flnds essentially the same rank distribution as 
for flxed Lq, a Gauss curve with a maximum at around r^ax — 16. However for values 
K > 70 new maxima appear for small values of r, which come from the fact that in this 
case the large number of flux vacua dominates the statistics and gives a larger weight to 
the actual few brane solutions with Lq < 0. Maybe it has no actual signiflcance, but let us 



observe that the semi-realistic flux vacua constructed in [^2[ belong precisely to this latter 
regime. 

Naively, one could have expected that at least for certain values of K one obtains a 
uniform distribution for the rank of the gauge group, but this does not seem to be the 
case. This flrst estimate of the influence of the flux sector on the distribution of gauge 
theoretic observables seems to tell us that even after averaging over the flux vacua one still 
gets non-trivial distributions of the gauge theory observables. 

49 



Of course here we have just gained a first ghmpse and it would be interesting to 
statisticaUy analyse the efi'ect on other gauge theory observables and in particular on 
those, which depend on the Type IIB complex structure moduli and therefore on the flux 



side of the model. More detailed physical quantities like Yukawa couplings or soft 
supersymmetry breaking terms [03 fall into this category. 



7. Conclusions 

In this paper we have investigated the statistical behaviour of solutions to the stringy 
tadpole cancellation conditions as they appear for intersecting D-brane models. Realizing 
that the problem of counting just solutions is similar in spirit to the evaluation of the 
asymptotic growth of the number of partitions, we have developed similar methods for 
counting solutions to tadpole equations. These methods are based on the saddle point 
approximation of the naturally occurring integrals and could be generalised to compute 
more sophisticated distributions of physical quantities. We have demonstrated for a simple 
enough 8D toy model that this approximate method is indeed in good agreement with an 
exact numerical computation of all solutions. Even when the absolute values differ by, say, 
one order of magnitude, the leading order saddle point approximation provides already a 
very good picture of the qualitative features of the statistical distributions. 

Encouraged by these observations we moved forward and applied similar methods to 
two concrete intersecting D-brane models in 6D and 4D. As a byproduct of our considera- 
tions we proved and, respectively, gave very strong evidence that the number of solutions 
of the tadpole cancellation conditions in the class of special Lagrangian cycles is flnite, 
even allowing varying complex structures. We found that many of the qualitative features 
appearing in physically more interesting 6D and 4D examples were already visible in the 
8D toy model. Intriguingly, in the former models we observed a statistical correlation 
between the number of families and the rank of the gauge group. This confirms that string 
theory as a constrained system can indeed give rise to non-trivial statistical correlations 
between physically relevant and measurable quantities. As was pointed out, such corre- 
lations might eventually lead to a statistical falsification of string theory or at least to 
statistical support for its relevance in nature. Since we are just beginning to explore the 
string theory landscape and develop mathematical methods for this purpose, it is way too 
early to make any definite statement. 
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We are very optimistic that the general methods developed here can also be applied 
to other statistical problems on the string theory landscape like counting of other types 
of string theory vacua. The moment one arrives at similar constraints for integer valued 
quantities which admit a large amount of solutions, these methods should work. We have 
in mind for instance counting heterotic vacua like the subclass of toroidal orbifolds with 
Wilson lines or the statistical analysis of Gepner model orientifolds . In the latter case 
a systematic computer search for semi-realistic models has been pioneered in . 

Of course eventually one has to perform the statistics over the ensemble of all string 
vacua and the various techniques developed for certain aspects such as counting of flux 
vacua and counting of tadpole solutions must be combined. We got a flrst glimpse of 
what can happen in this case in section 6. Many ways to improve our understanding of 
the statistics on the string theory landscape lie ahead and we are very curious to which 
conclusions they will flnally lead us. Can we ever falsify string theory or provide at least 
strong statistical evidence for it or do we have to face the unsatisfactory conclusion that 
we will never know? But even if we can gain statistical evidence, can we then move 
beyond this approach and really flnd at least some of the realistic string vacua, which 
from the phenomenological point of view would still be quite valuable, or do we have to 
face the also depressing scenario that - in the flnite amount of time their species exists - 
the best physicists can learn about the underlying structure of the universe, are statistical 
correlations? 
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Appendix A. Coprime wrapping numbers in 6D 

The quantities Xj,Yi are defined in (|2.11|) and satisfy tlie relation ( p.l2| ). In tliis 
appendix we want to sliow fiow to recover tfie wrapping numbers from given Xj^Yj e Z 
and how to implement the coprime condition which the wrapping numbers should fulfil. 

Of course we have the problem that not all chosen values for Xi,Yj G Z with 
Xi X2 = Yi Y2 correspond to configurations with coprime wrapping numbers, e.g. take 
Xi = —X2 = Y2 = —Yi = 2. First we search for resolutions for non-coprime wrapping 
numbers (nj, mj) = aj {pi, qj) with pi, qj coprime. Defining a = aia2 we can write 

api = gcd(Xi, Yi), 
ap2 = gcd(Xi, Y2), 
aqi = gcd(X2, 12), 
aq2 = gcd(X2, Yi, ) 
and the coprime numbers are given by 



(A.l) 



Pi 

Q2 

while the global prefactor a can be written as 



Xi 




gcd{X,,Y2) 


gcd(X2,yi) 
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(A.2) 



a 



gcd(Xi,y2)g 


cd(X2,y2) 


Y2 




gcd(Xi,y2)g 


cd(X2,yi) 


X, 




gcd(X2,Y-i)g 


cd(Xi,yi) 


Yi 




gcd(X2,y2)g 


cd(X2,yi) 



(A.3) 



X2 

From ( [A.2|) and ( |A.3|) one sees, that all pi,qi,a have exactly one expression in terms of 
Xi, X2, Y2. By construction, pi, qj are integers, upon Yi = ^yf^ G Z one can also easily 
see that the formulae above give an integer a. 

To summarize, given integers Xi,X2,Y2 with ^y^^ G Z can be rewritten in terms 
of coprime wrapping numbers pi, qj and a global prefactor a. Now, the condition for no 
multiple wrappings reads ct = 1, from which we recover ( [4.1| ). 
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Appendix B. Coprime wrapping numbers in 4D 

With the definition from section 2, we first have to recover the coprime integer wrap- 
ping numbers from Xj^Yj. We start as in the 6D case with non-coprime ones. Now we 
search for resolutions for non-coprime wrapping numbers (n/, mi) = aj [pi, qi) with pi, qj 
coprime and define a = 010203. Then we have for I J K ^ I , I , J, K E {1, 2, 3} 

apipj = gcd(XQ, Yk), 

oiqiqj = gcd(Yo,XK), (B.l) 



and thereby 



Pk 

QK 



api qj = gcd(Xj,y7), 
Xo Yj Xk 



gcd{Xo,YK) gcdiXi.Yj) gcdiYo.XK) 
Yo Xi Yk 



gcdiYo.XK) gcdiXi.Yj) gcdiXo.YK)' 
and several relations of the form 

Xk Xo Yj 
apK = = = , 

qi qj pi pj qi pj 

Yk Yo Xj 
aqK = = = , 

piPj qiqj piqj 

which lead to the expression in terms of only Xo, Yq, Xk 



a 

1=1 



(B.2) 



(B.3) 



Xonffi^^^ngcd(yo,X.). (B.4) 



The final condition for no multiple wrappings reads a = 1. This gives equation (|5.1| ). 



Appendix C. Summary of the computational technique 

In this section we would like to briefiy summarise the main computational technique 
we used to determine the distribution of various gauge theoretic observables in the ensemble 
of intersecting D-brane models for fixed geometric background. 

The first step is to determine all or at least a large, preferably representative subset of 
supersymmetric branes. After solving the supersymmetry constraints, in all our examples 
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this was given by a subset S of the naively allowed wrapping numbers Xj. Then the total 
number of solutions to the tadpole cancellation conditions 

k 

Y,NaXa,I = Li (C.l) 

0=1 

with / = 1, . . . , 63/2 is given by the expression 

[2m) 2 J J Qj \xjes ^ " 11/^/ / 

which can be evaluated at leading order by a saddle point approximation with 

n, «f ' 



Xi£S^ i^i-l'il I 



qi. (C.3) 



The saddle point is determined by the condition V/(q) I^-q = 0, and the second order saddle 
point approximation reads 

A^'''W = — ^ , . (C.4) 
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An observable O in this example is given by a function 0(N, X/), where here N and 
X/ denote vectors with respect to the number of stacks. The expectation value of this 
observable in our ensemble is defined as 



(o(N.x,))(i:). / n-g^r E - E E - E 

(C.5) 

Depending on the actual form of 0(N, X/) these sums can be further simplified and in the 
best case reduced to just a few sums over 5'. The final expression is then to be evaluated 
using a saddle point approximation. 

Similarly the distribution of C(N, X/) = r is given by the expression 

^' " ^rf 1^.. /n -fe E F E - E E - E 

TV (L) (27rz) 2 +1 J J \qj )z «. ^^^^ iv,=iXi,,€S x,,,€S 



(C.6) 
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Again simplifications can occur for certain choices of 0{N, X/), which in effect also simplify 
the saddle point approximation. 

One can easily derive further generalisations of these expressions, which in one way or 
the other will be appropriate to study similar statistical questions concerned with counting 
solutions to discrete stringy consistency conditions. 
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